The article presents the Pico-Banana-400K dataset, which consists of approximately 400,000 text-image-edit triplets aimed at enhancing research in text-guided image editing. It features a variety of edit operations across multiple semantic categories, with evaluations conducted using advanced AI models to ensure high-quality edits. This dataset is designed to support both single-step and multi-turn editing applications.
dataset ✓
image editing ✓
text-guided ✓